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<^ I Abstract 

ly-v ■ We prove the conjecture by Diaconis and Eriksson (2006) that the Markov degree of the 

Birkhoff model is three. In fact we prove the conjecture in a generahzation of the Birkhoff 
r , I model, where each voter is asked to rank a fixed number, say r, of candidates among all 

ly^ ' candidates. We also give an exhaustive characterization of Markov bases for small r. 

^ '. Keywords and phrases: algebraic statistics, Markov basis, normality of semigroup, ranking 

c^ ! model. 



(N : 1 Preliminaries 
> 

Diaconis and Eriksson [|6l conjectured that the Markov degree of the Birkhoff model is three, 

CN ! i.e., the toric ideal associated with the Birkhoff model is generated by binomials of degree at 

most three. In this paper we give a proof of this conjecture in a generalization of the Birkhoff 

model, where each voter is asked to rank a fixed number of most preferred candidates among all 

cn ■ candidates. Our proof is based on arguments of Jacobson and Matthews [|3 for Latin squares. 

The set of Latin squares is a particular fiber in our setting and our result is also a generalization 

of [|71. See [|3l for terminology of algebraic statistics and toric ideals used in this paper. 

^ I Consider an election, where there are n candidates and A'^ voters. Each voter is asked to give 

r (1 < r < n) preferred candidates and to rank them. For example, let n = 5, r = 3 and let the 

candidates be labeled as a, b, c, d, e. A vote (a, c, d) by a voter means that he/she ranks a first, c 

second and d third. For a positive integer m, denote [m] = {1, . . . , m}. When the candidates are 

labeled as 1, . . . , n, the set of possible votes is 

n\ 

Sn,r = {o- = (cr(l), . . . , cr(r)) I a- : injection from [r] to [n]}, \S „J = -, 

(n - r) ! 

where o-{j) denotes the candidate chosen in the j'-th position in the vote cr = ((t(1), . . . , cr(r)). 
Let i//jk, j e [r], k 6 [n], be positive parameters and define a probability distribution over 5„ , by 



^) = ^t]'I^Mfl, Z=J] Y]^j 



P(o-) = - I I <Am;)' ^ = Zj 1 1 '^J'^^J^ *^^^ 

a-eS„^r y'=l 
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If if/jk is large, then the candidate k is likely to be ranked in the 7-th position. When r = n, this 
model is called the Birkhoff model (flU, [|9l). In this paper we call ([T]) an (n, r)-BirkhofF model. 
The sufficient statistic of the (n, r)-BirkhofF model consists of numbers of times the candidate k 
is ranked in the j-th position, j e [r], k e [n]. We denote the sufficient statistic as (tjk)je[r]M[n]- 

Define a 0-1 matrix A = A„,- of size rn x {nl/(n - r)!), called a configuration matrix for 
the (n, r)-Birkhoff model, whose columns are labeled by cr e 5„,. and rows are labeled by 
(j, k) = (position, candidate), such that the {{j, k), cr)-element of A is one if and only if cr(j) = k. 
For example, for n = 4, r = 3, the configuration matrix A4,3 with labels for its rows and columns 
is displayed as follows. 

m's^^^lxl-^^lmmrJ-rtrl■•-^m^^lr^^^'^^^l^^lr<^^^<^'-^^^l 

(1.1) 1 1 1 1 1 1000000000000000000 

(1.2) 0000001 1 1 1 1 1000000000000 

(1.3) 0000000000001 1 1 1 1 1000000 

(1.4) 0000000000000000001 1 1 1 1 1 

(2.1) 0000001 100001 100001 10000 

(2.2) 110000000000001100001100 (2) 

(2. 3) 001 100001 10000000000001 1 

(2.4) 00001 100001 100001 1000000 

(3.1) 000000001010001010001010 

(3.2) 001010000000100001 100001 

(3.3) 100001 100001000000010100 

(3.4) 010100010100010100000000 

Let x(cr) 6 N = {0, 1, . . . } be the frequency of voters choosing a vote cr 6 S„,,- and let 
jc = {x(cr) I cr e Sn,r} be the vector of frequencies. Then t = An^yX is the vector of sufficient 
statistic. For a given t, Tt = {x ^ nI-^'vI | Ajc = ^} is called the /-fiber. 

Let K be any field and let KWpia), cr 6 5„,,-}] be the polynomial ring in the indeterminates 
pier), cr e 5„ ,-. Similarly let K[{i/fji„ j e [r],k e [n]}] be the polynomial ring in the indetermi- 
nates ij/jk,] e [r],k e [n]. Let 

n„,r : K[{p(o-) I cr e S,J] ^ K[{i^jk,j e [r],ke [n]}] 

be a homomorphism defined by 

r 
nn,r ■ P{0-) ^ fflAiVrO)- 

7=1 

Then the toric ideal Ia = I a,,,, for the {n, r)-Birkhoff model is defined to be the kemel of ;r„ ,-■ 
Elements of the integer kemel kevzAn^r = {z e Z''^'"' | Az = 0} of A,,^ are called moves for A,,,.. 
Note that if a voter ranks r = n-l most preferred candidates, then he/she automatically ranks 
the last candidate. It can be easily seen that the configuration matrix A„„_i for the {n,n - 1)- 
Birkhoff model and the configuration matrix A„ „ for the Birkhoff model have the same number 
of columns and their integer kernels are the same: ker^ A„„_i = kerz A„„. 



2 Main result and its proof 

The main result of this paper is the following theorem. 

Theorem 2.1. For r > 2 and n > 3, the toric ideal I a for the (n, r)-Birkhoff model is generated 
by binomials of degree two and three. 

For r = I or r = n = 2, the toric ideal Ia is trivial. For r >2 and n > 3, any set of generators 
for I A contains a binomial of degree three (see Section 13.21 ). In the terminology of algebraic 
statistics, Theorem 12.11 states that the Markov degree of the (n, r)-Birkhoff model is three for 
r >2 and n >3. 

The rest of this section is devoted to a proof of this theorem. 

We define some notation and terminology for our proof, mainly following [|71. We denote 
candidates either by alphabets a,b,c, . . . or by numbers 1 , . . . , n. The set of n candidates is 
denoted by [n], using numbers. As in Section[Tl each vote, such as (a, c, d), is denoted by a row 
vector. A dataset of N votes is denoted by an A'^ x r matrix P whose entries are candidates. A 
row is a vote and a column is a multiset of candidates at the same position or rank. An example 
of a dataset for n = 5,r = 3 and A'^ = 4, is written as 



P = (Pij) = 



a c d 

e a b 

c d a 

b d c 



where the candidates are labeled as a, b, c, d, e. Although the order of the rows of P are arbitrary, 
this matrix notation is convenient for our proof. A Latin square with N = r = nis a special case 
in our problem. 

When the candidates are denoted by 1 , . . . , n, a dataset P can be regarded as a three-dimensional 
0-1 cuboid C = Cp with N Xrxn cells 

Cp = (Cijk), i £ [N], je[r], ke [n], 

where Cjjk = 1 if and only if Pij = k. Following the terminology of Latin squares, we call Cp the 
orthogonal array representation of P. A 0-1 cuboid C of size N x r x n is an orthogonal array 
representation of an A'' x r dataset P if and only if the row sums (2; c,y^) are either zero or one, 
the vertical sums (2i cijk) arc one. The column sums 



tjk = ^Cijk, j £[r], ke [n] 



are the elements of the sufficient statistic t for the (n, r)-Birkhoff model. In the special case of 
Latin squares all the line sums are one: 1 = YiiCtjk = YijCijk = 'Lk'^ijk, '^i,j,k- In addition to 
"valid" votes, we consider two kinds of invalid votes. 

Improper vote. The first type of invalid vote is an improper vote, which contains an element 
of the form b + c-a, where a,b,c are different. An example of an improper vote for n = 5 

and r = 4 is 

(a,a,b + c - a,d). 



which contains an improper element at the third position. We can interpret this improper 
vote in the orthogonal array representation. Suppose that this vote is the i-th row of P. 
The (i, *, *)-slice of Cp is written as 
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(3) 



Note that the row sums of this {i, *, *)-slice are zero or one and the vertical sums of 
this (z, *, *)-slice are one, as in the case of a valid vote. We require this property for an 
improper vote. This requirement implies the following restriction. If a vote contains an 
improper element b + c - a m the j-th position, then neither b nor c appears in other 
positions and a appears once or twice in other positions. In our proof we consider a 
dataset P, which contains at most one improper vote, with the row slice of Cp given as in 
([3]). As in a dataset containing only valid votes, we further require that the column sums 
tjk = Yii Cijk are nonnegative integers. This implies that if b + c -a appears in (i, j)-element 
of P, then a has to appear in the j-th position in some other row of P. 

In summary, we consider P such that its orthogonal array representation Cp contains one 
-1 and its line sums are the same as datasets consisting of valid votes only. We call such 
a P an improper dataset. We call a dataset P proper if it consists of valid votes only. In 
the proof below, we denote an improper dataset by /. The elements of / are denoted by 
Lij,i£ [N],j & [r]. 

The following example gives an improper dataset / for A'^ = 5, n = 6, r = 4 with the 
candidates labeled as a, b, c, d, e, f: 



f b c d 

a d be 

a e + f — a a b 

d a e c 

c a d f 



Vote with collision. The second type of an invalid vote is a vote containing a candidate twice, 
such as (a,b,b). We say that this vote contains a collision or the candidate b collides in 
this vote. 

We also need to consider collisions for a vote containing an improper element. Let a vote 
contain b + c-a in the 7-th position. We say that b (resp. c) collides in this vote if b (resp. 
c) appears in some position other than j. We say that a collides in this vote if a appears 
three times in positions other than j. Note that in the definition of an improper vote above, 
we did not consider collisions. In the following, when we just refer to an improper vote, 
the vote should not contain a collision. If a vote contains both an improper element and a 
collision, we call it an improper vote with collisions. 



We now consider a basic operation, which we call a swap, for transforming a dataset to 
another. This operation does not alter the sufficient statistic of the dataset. By this operation 
we interchange elements in the same position in two rows of P. For illustration, consider the 
upper- left 2x1 submatrix of P and let pn = a, p2\ =b,a i^b. Then by adding 



to the submatrix, we swap a and b: 



a 
b 

* 



* * 



+ 



b - a 

a - b 





b - a 
a - b 





















b * 
a * 

* * 



(4) 



where elements denoted by * are not changed. Note that a and b may collide after the swap. 
Note also that the addition and the equality in ^ should be understood in the three-dimensional 
orthogonal array representation. However, the two-dimensional display in @i is more conve- 
nient. 

We denote the simple swap in dH) by a <-> Z?. When we want to specify the position (column) 

j of a and b, we denote a <r^ b. When we further want to specify the rows i, i', we denote 

{i,i'} : a <-^ b. 

In dll) we swapped a and b which were already in P. The result was \ ^ h where both 

elements were proper. However we also consider swapping a and ^ in ^ , where a,b,c are 
diff"erent. This operation is written as 



a 




b-a 






+ 




^ 


c 




a-b 





b 
a + c ■ 



(5) 



We call the swap in ^ improper and the swap in (HJ proper. In an improper swap a <r^ b for 
the z-th and z'-th rows at the 7-th column of P, we require that one of a, b is the (z, j)-element P 

or the (z', j)-element of P. Then the notation {z, z'} : a <r^ bh well defined. 

We now consider a sequence of swaps. Consider swapping a and b in two different positions 
J, / in the same z'-th and z'-th rows. In our proof below, we often perform these two swaps 
sequentially, i.e., we swap a and b in the j-th column first and then in the /-th column. We 
denote this operation as 



a <r^ b <r^ a 



or 



[1,1 



: a <-^ b <r^ a 



and call this a double swap. The double swap corresponds to the basic move for no three-factor 

interaction model (cf. [i4J) in the orthogonal array representations of datasets. As an example, a 

1 2 
double swap a <r^ b <r^ a, where the second swap is improper, is written as 

b 



a 


b 


+ 


b 


c 





b 
a 



a 
b 



a 
b 



b 
a 



a 
b + c 



(6) 



More generally, we consider a sequence of m swaps in columns j'l, . . . ,j,n, such that two 
consecutive swaps involve a common candidate, and denote it as 



or indicating the rows as 



7l 72 Jm-l Jm 

ai <-> a2 <-> • • • <-> a„, <-> a„,+i (7) 



{i, i } : ai <-^ a2 <-> • • • <-> a„, <-> a^+i . (8) 



We call O (or dS])) a chain swap of length m (even when ai = a„,+i, i.e., we do not make a 
distinction between a chain and a loop). A chain swap of length one is just a swap. 

Suppose that we perform several chain swaps for the same two rows and ignore the order of 
swaps. Note that an even number of swaps at the same column results in no swap and an odd 
number of swaps results in a single swap. Hence the end result of several chain swaps is a set of 
simultaneous swaps of a subset of columns among the two rows. We call this a swap operation 
for a subset of columns among two rows, or simply a swap operation among two rows. When 
we apply a swap operation to P for a subset J of columns among two rows R = {i, i'} and the 
result is P', we denote the operation by a long double sided arrow: 

P^P\ 

where we omit 7, because it is often cumbersome to specify 7. We use the same notation 
when the datasets are improper, although we also need the condition of compatibility defined in 
Definition 12. 6[ 

We now give a proof of Theorem 12. 1 l in a series of lemmas. Let P and P' be two datasets with 
the same sufficient statistic. Our strategy for a proof is to perform operations to P, involving 
at most three rows of P at each step, to increase the number of the common elements in P and 
P'. In each operation, elements at the same position of the three rows of P are permuted. This 
corresponds to a move of degree at most three. In fact, each operation will be further decom- 
posed into a series of swap operations among two rows, which involve intermediate improper 
datasets. For the i-th row of P and the i'-th row of P' 

(pn,...,Pir), {p'i,^,...,p'i,,), 

let 

y = Vw = \{j I Pij = p'rjW (9) 

be the number of the same candidates in the same positions in these two rows. We call V the 
number of concurrences. \f V = r, then we can remove these two votes from P and P' and 
consider other N - I votes. On the other hand, we will show that, if V < r then we can always 
increase V by a series of operations involving at most three rows of P. The /-th row of P will 
eventually coincide with the /' -th row of P' . Then Theorem 12. H is proved by induction on N. 
Our first lemma concerns resolving collisions. 

Lemma 2.2. Suppose that two rows ofP contain collisions and they do not contain an improper 
element. If each candidate appears at most twice in these two rows, then we can resolve the 
collisions by a swap operation among these two rows. 



Remark 2.3. We can prove this lemma based on the normality of the semigroup generated by 
the configuration matrix A,, ^ such as A4 3 in Q- The normality follows from results in [|8l, [ITOl 
and II2I. We will discuss this point again in Section I4~2l However we give our own proof of 
Lemma [Z2l because we will use similar arguments for improper datasets. Arguments based on 
the normality cannot be applied to improper datasets. 

Proof. We first consider the case that there is only one collision in one of the votes. Let a denote 
the colliding candidate. Relabeling the rows and the positions, without loss of generality, the 
two rows are displayed as 

a a d * ■ ■ • * 

b c * * • • • * 

where b i^ c. We choose one of the two a's arbitrarily, say in the second position, and make a 

2 
swap a <r^ c with the following result: 



a c d * ... * 
b a * * ... * 



(10) 



Since a appears at most twice in these two rows, a does not collide in the second row. However 
c might again collide in the first row, e.g.. 



\ a c d c * ... *. 



We then make a swap for c, which was in the first row from the beginning (in this example 

4 
c <r-> *). If we continue this process, we always have collisions in the first row. If this process 

ends in finite number of steps, then by a chain swap we resolve the collisions of a and subsequent 

collisions due to swaps. We claim that this process indeed ends in finite number of steps. 

Actually we show a stronger result that no candidate appears twice in this process of resolving 

collisions. 

Suppose otherwise. Then there is a candidate, say a, which is swapped twice for the first 
time. We consider two cases a = a and a i^ a. 

Suppose a = a. The process of swaps is displayed as follows: 

Since the collision always occurs in the first row, the candidate a was moved from the second 
row to the first row in the swap 5/_i «-^ a. By ( flOl ) we have c = 5/_i, which contradicts the 
assumption that or = a is the first candidate colliding twice. 

Consider the case a i^ a. The process of swaps is displayed as follows: 

a <-> c <-> 51 <-> • • • «-^ si-i <r-> a <r-> si+i <->•••<-> Sm-i ^ a <r-> ■ ■ ■ . (11) 

Considering the subprocess of dH) which starts from the first a, we can apply the discussion 
for the a = a and confirm that there exists a contradiction. We have shown the lemma for the 
case that there is only one collision. 
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Now suppose that there are m colliding candidates a\,a2,.-., a,„. Each of these candidates 
appear in one of the rows twice. Temporarily, we assign different labels, say a'j, a", I = 2, . . . , m, 
to candidates except for a^, namely, we ignore collisions of ai, ■ ■ ■ ,a,„. Then by the above 
procedure we resolve the collision of a^ and subsequent collisions. When this procedure is 
finished, we restore the labels a^ a" -^ ai, I = 2, . . . , m. Then some collisions of ai, . . . , a,„ 
may have been already resolved, but we do not have any new collisions. Hence by the above 
procedure we decrease the number of collisions. As long as there is a remaining collision, we 
can repeat this procedure and resolve all the collisions. n 

So far we discussed resolving collisions. We now consider resolving an improper element 
by a swap operation among two rows. Let b + c - ahe an improper element in the j-th column 
in an improper dataset /. Since the elements of the sufficient statistic of / are assumed to be 
nonnegative, there is a row of / containing a in the same position as b + c - a. We first consider 
a swap between these two elements. If we make a swap a <r^ b, then b + c - a becomes c and a 
becomes b: 



b + c 
a 



+ 



a 
b 



b 
a 



c 
b 



(12) 



Similarly a <r^ c results in ^ . Note that ^ and ^ are swaps of each other. Hence the 
result of several swaps can be regarded as a single swap a <r^ b or a <r^ c. Although there is an 
ambiguity between a <r^ b or a <r^ c, the result of a swap between these rows at the j-th column 
is either c or K^ . By allowing this ambiguity a swap operation among two rows (for a 
subset of columns) is defined for an improper dataset /. We now have the following lemma. 

Lemma 2.4. Let I be an improper dataset, containing an element b + c-a. By a swap operation 
among two rows, I can be transformed to a proper dataset. 

Proof. Without loss of generality, assume that the first row contains b + c ~ a and the second 
row contains a. We can then make a swap {1,2} : a <r^ b, as in (fT2l) . Here a may collide in the 
first row and b may collide in the second row. However both a and b appear at most twice in 
these two rows. Hence we can now resolve these possible collisions by Lemma [Z2] by a swap 
operation among these two rows. D 

The operation of this lemma is denoted by 

/ ^ P, (13) 

where Ris a set of two rows of /. 

At this point we make the following two definitions. 

Definition 2.5. We call two rows in Lemma |Z4l of the form 



* 


b + c- 


a 


* 


* 


* 


a 




* 


* 



a resolvable pair. Here Zim is an improper row and Zpr is a proper row. A resolvable pair is 
denoted as [iimJpi]- 
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Note that any improper dataset / contains a resolvable pair [ii^, ip^] and R in (fT3l) is the rows 
of a resolvable pair. 



Furthermore we consider a swap between two elements in 
b <r^ d or c <r^ d between these two elements. After b <r^ dwe have 



+ c — a 
d 



,di^a,b,c. We allow 



b + c - a 
d 



d + c - a 
b 



and to 



d + c — a 
b 



we can make further swaps c <r^ b or d <r^ b. The end result of several swaps 



is one of the following three cases 



c + d - a 
b 



b + d - a 
c 



or 



b + c - a 
d 



+ c ■ 
d 



These three cases correspond to single swaps b <r^ d, c <r^ d and to no swap to 

Hence if we allow this ambiguity, a swap operation between an improper row and a proper row 
(for a subset of columns) of an improper dataset /, resulting in another improper dataset /', is 
denoted as 



/< — > r. 



(14) 



By the notation (fT4l) we also consider the case that a swap operation is applied to two proper 
rows of / and /'. We now make the following definition concerning ([14] ). 

Definition 2.6. A swap operation among two rows R = {i, i'} in (IT4l) is compatible with improper 
datasets / and /' if there exists a common resolvable pair [z'im, Zpi] of / and /' such that R n 
{Am, ipr} i" 0, or equivalently \R U {i,^, ip,}\ < 3. 

Lemma 2.7. Let P, P' be two proper datasets with the same sufficient statistic. Choose any 
i-th row from P and any i'-th row from P', which are different, and let V, V < r, in (|9l), be 
the number of concurrences in these two rows. If we allow improper datasets, then V can be 
increased by at most three steps of swap operations among two rows applied to P, such that 1) 
if the resulting data set is improper then its improper row and the i-th row form a resolvable 
pair, and 2) each intermediate swap operation between two consecutive improper datasets is 
compatible with them. 

Proof. Without loss of generality we consider the first rows of P and P'. We consider two 
disjoint cases. 

Case 1 There is a candidate in difi'erent positions in the two rows. 

Let b be the candidate appearing in different positions in two rows. Relabeling the po- 
sitions, without loss of generality, let pn = a, p[^ = b, a ^ b, and pi2 = b. Since P' 
contains b in the first column and the sufficient statistic for P and P' is common, P has to 



contain b in the first column, say p2\ = b. We now perform a double swap a<-^b<r^ato 
P: 



a 



a 
b + c ■ 



where *'s are not changed. By this double swap V is increased. If c = a, the swap results 
in a proper dataset. Otherwise, the swap results in an improper dataset, where [2, 1] forms 

11.2) (1,2) 

a resolvable pair. Therefore, V is increased by a process of the form P <— — > P or P < — > I. 

Case 2 Every candidate appearing twice in the two rows appears in the same position. 

Again let pn = a, p[^ = b, a i^ b. b does not appear in the first row of P and a does not 
appear in the first row of P'. As in Case 1, we can assume /?2i = b. Since the first row of 
P' does not contain a, the total frequency of the candidate a in P' is less than A^. Since the 
sufficient statistic is common, it follows that there is a row of P which does not contain a. 

If the second row does not contain a, we can make a swap a <r^ b among the first two 

{1,2| 

rows and increase V without causing collision. This process is of the form P < — > P. 

If the second row contains a, without loss of generality, we let P22 = a and also assume 
that the third row of P does not contain a. Let ^32 = c i^ a. Since a is chosen in the 
second row and not chosen in the third row and both rows have the same number r of 
candidates, there is a candidate d, who is chosen in the third row but is not chosen in the 
second row. If d is in the position j > 2, then by relabeling of positions we assume that 
7733 = d. Then P looks like 



a * * 

b a * 

d c * 

* * * 

* * * 



. . . * 




. . . * 




* 




* 


or 


. . . * 





a * * 
b a * 
* d{= c) * 



* 



* 



* 



* * 



. . . * 




. . . * 




* 




* 


or 


. . . * 





a * * 

b a * 

* c d 

* * * 

* * * 



We perform a swap {2,3} : a <r^ dto the second column of the second and third rows: 



a * * 

b d * 

d a + c - d * 

* * * 



* 




• • • * 




• • • * 




* 


or 


* 





a * * 

b d * 

* a * 

* * * 

* * * 



* 




* 




* 
* 


or 


* 





a * * 

b d * 

* a + c - d d 

* * * 



After the swap the second row does not contain a. The result is proper if c = J (the middle 
case) and improper if c i^ d. 
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Now we apply a swap {1, 2} : a <-^ ^ for the first column of the first and the second rows 
and increase V. In the case c i^ d, the last swap was performed on an improper dataset, 
but it is compatible with the datasets. Furthermore we can resolve the improper element 
a + c - dhy Lemma I2.4[ since [3, 2] is a resolvable pair. The process in this case is 

|2,3| {1,21 {2,3| |1,2| {2,3) 

summarized as P < — > P < — > P or P < — > / < — > / < — > P. 



This proves the lemma. n 

Lemma 2.8. Let I be an improper dataset and P' be a proper dataset with the same sufficient 
statistic. Choose any i'-th row from P' and choose any resolvable pair [ii^, Zpr] of I. Then 
by at most three swap operations among two rows to I, we can 1) increase the number of 
concurrences Vj ^ ,', or 2) make I proper without changing the i^^-th row of I. Furthermore, 1) 
if the resulting data set is improper then its improper row and the i^^-th row form a resolvable 
pair, and 2) each intermediate swap operation between two consecutive improper datasets is 
compatible with them. 

To prove Lemma 1X81 we need the following two lemmas. 

Lemma 2.9. Let I be an improper dataset with li^^j = b + c - a. Suppose that iif = a where 
i ^ Urn, f ^ j cind the pair ofi\m and i is not a resolvable pair Then, letting Ltj = d i^ a, I can 
be transformed by a swap operation among two rows R = (?'im, i] to another improper dataset /' 
containing the improper z'i, 
c + d - a. 



-th row where C- ■, = a and l'- ■ is either ofb + c-a, b + d - a or 

*im7 'imj 



Proof. Without loss of generality, assume that i[^ = j = I and i = f = 2. Let L12 = e i^ b,c. We 

2 
first make a swap of { 1 , 2} : a <-> e to /: 



b + c - a e * ••• * 
d a * ■ ■ ■ * 

* * * • • • * 



b + c - a 
d 

* 



a 
e 

* 



After the swap, a may collide in the first row and e may collide in the second row. If there is no 
collision, the claim of this lemma is proved. Otherwise, we can resolve these possible collisions 
in the following way. 

We try to resolve the collision of e in the second row as in Lemma |Z2] considering a swap 
process: 

{1,2} : e <r-> si <r-> S2 <r-> ■■ ■ . (15) 

In this process the collisions always occur in the second row. 

Consider the case that d is equal to b or c, say d = b. Since the first and the second row do 
not contain b other than in the first column, b does not collide in (fTSl) . which implies that c does 
not collide in (fT5l) . Since no a is in the second row at the beginning of (fT5l) . a does not collide 



11 



in (|T5I) . Therefore, there is no swap involving the first column in (1151) . which implies that the 
collision of e can be resolved as in Lemma IZ2l 

Consider the case d i^ b,c. The difference of this case from Lemma |Z2] is that the process 

(fT5l) may hit the first column. This happens when d appears in (fT5l) for the fist time as si <-^ d, 
j i^ I, and d = l\j in the first row is swapped down to the second row in the j-th column. Then 
we need to choose b ox c and make the swap d <r^ b ox d <r^ cm the first column. By symmetry, 

without loss of generality, we perform d <r^ c: 



b + c - a 
d 



b + d - a 
c 



This amounts to ignoring b and -a and we look at the improper element b + c - a just as a 
proper element c in resolving the collision of e. We leave b - a intact in the (1, l)-element of 
/ during the sequence in (fTST ). Then just as in Lemma |Z2] it follows that no candidate appears 
twice in (fTSl ). Note that b and -a which were left in the (1 , l)-element cause no trouble, because 
collision occurs always in the second row. Indeed, b causes no trouble because it does not leave 
the first row. a causes no trouble because the second row does not initially contain a and when 
a is swapped from the first row to the second row, then the process in (fT5l) ends at that point. 

After the collision of e is resolved, a may still collide in the first row. Let ji and 72 be the 
labels of rows containing a in the first row other than the first column. To resolve this collision 
we consider the following two swap processes: 

{\,2} : a <r-> si <r-> S2 <r^ ■ ■ ■ , (16) 

{1,2} : a <r-> Si <r-> S2 <r^ ■ ■ ■ , (17) 



where no swap in the j2-th column is involved in (1161) and no swap in the ji-th column is 
involved in (flTI) . Since every candidate in the first and second rows except a appears in at most 
two columns, the common candidate involved both in (fT6l ) and in (flTI ) is a only. Then one of 
(fT6l ) and (IT71 ). say (fT6l ). involves neither b nor c, or involves b and no c. Therefore, ignoring 
c, -a and a in the j2-th column, we see that the swap process (fT6l) ends in finite number of steps 
as in Lemma |2^ D 

Lemma 2.10. Let I be an improper dataset with an improper element ti-^^^j = b + c - a. Let 
Lij = d,i i^ ?'ini, and suppose that d i^ a,b, c. Then I can be transformed to another improper 
dataset r by a swap operation among two rows R = {ii^,i} such that either l'-^^. = b+d-a,L\- = c 
or l'- ■ = c + d - a, l'-- = b. 

Proof. Without loss of generality assume iim = 7 = 1 and i = 2. Then the upper-left 2x1 
submatrix of / is ^ j ~ '^ • Note that [1,2] is not a resolvable pair because d i^ a. 

We begin by considering two swaps oi{l,2] : d <r^ b and {1, 2} : J <-^ c. If {1, 2} : J <-^ ^ is 

applied to /, b may collide in the second row and d may collide in the first row. If { 1 , 2} : J <-^ c 
is applied to /, c may collide in the second row and d may collide in the first row. Considering 
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the resolution of possible collisions in the second row for each swap, the following two swap 
processes are obtained: 



[1,2} : d <r^ b <r^ si <-> 52 <-^ 
[1,2} : J <-> c <-> s'l <-> 4 <-^ 



(18) 
(19) 



Since the number of columns which contains a in the first or second row is at most three, one of 
(fTSl) and (fT9l ), say (fTSl) . contains a at most once. Note that each candidates other than a appears 
in the first and second row at most twice. If (fTST ) does not contain a, we see that (fTSi) ends in 
finite number of steps as in Lemma IT2l If (fTSi) contains one a, the finiteness of (fTSi) is proved 
by applying the similar discussion of Lemma |2!2] for the subprocess of (fTSl) which starts from 
a. 

After resolving the collision of b in the second row, d may still collide in the first row. At 
this point the second row contains at most one a. Consider a swap process 

{1,2}: J^ /; ^ 4' ^ •■• . (20) 

Since b has already been involved in (ITST ). no s" is equal to b. If some s" is c, the chain swap 

{1,2} : d <r^ S" <r-> S2 <r-> • • • <r-> C 

resolves the collisions in the first row. Since a appears in the second row at most once, the 
process (|20] | contains a at most once. If a does not appear in (|20|) . the process does not hit the 
first column and we see that (ITST ) ends in finite number of steps as in Lemma [Z2l If a appears 
in (I20I) . the finiteness of (jTSl) is proved by applying the similar discussion of Lemma IZ2] for the 
subprocess of (ITST i which starts from a. o 



Proof of Lemma IZg] Without loss of generality, let i' = 1, [iim,ipr] = [2, 1], in = a, and t2i 

b + c - a. Then / looks like 

a * • • • * 

b + c - a * • • • * 

^ ^ . . . # 



In the cases below, where a resulting dataset is improper, [2,1] will be a resolvable pair. 

Case 1 p\^ = a. 

In this case in P' and hence in /, the candidate a appears at least once in the first column. 
Therefore a is in the first position in some row ? > 2 in /. Let i = 3. Then the rows [2, 3] 
of I form a revolvable pair and / can be transformed to a proper dataset by Lemma 12.41 

|2,3| 

This corresponds to 2) of the lemma and is summarized as / < — > P. 



Case 2 pjj ^ a, but a appears in the first row of P'. 

Without loss of generality let p\2 = a- Let d = lu- 
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Case 2-1 L22 = a. 

1 2 
We perform the double swap a <-^ (i <-> a to the first two rows 



a 
b + c 

* 



d 
a 

* 





r d 




a 


* 


* 




b + c - 


d 


d 


* 


* 


^ 


* 




* 


* 


* 




* 




* 


* 


* 



This increases Vn. This corresponds to 1) of the lemma and is summarized as 



11,21 

I La I. 



Case 2-2 L22 i^ a- 

Since p\2 = a,a has to appear in the second column of /. Without loss of generality, 
let 632 = a. Let e = (31 and / = ^22- Then P looks like 



a 


d 


* 


* 


b + c — a 


f 


* 


* 


e 


a 


* 


* 


* 


* 


* 


* 



From Lemma |Z9] applied to rows {2, 3}, this case is reduced to Case 2-1. This case 

|2,31 |1,2| 

together with the subsequent operation of Case 2-1 is summarized as / < — > I < — > I. 

Case 3 a does not appear in the first row of P'. 

Let d = p\^, d i^ a. We will change a = l\i to d and increase V. 

If d = b or d = c,we directly go to the Cases 3-1 and 3-2 below. \i d i^ b, c, we need an 
extra step. Let ^31 = d without loss of generality. Then / looks like 



a 
b + c- 
d 

* 



By Lemma [2. 101 applied to rows {2, 3}, we move d to the second row resolving the possible 
collisions. At this point the (2, l)-element of / may he b + d-a or c + d- a. We consider 
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the former case without loss of generaUty. Then / looks like 



a 

b + d 

c 

* 



Now we try to move d to the first row. We further distinguish two cases. 
Case 3-1 d appears in the first row of /. 

1 2 

We apply a double swap a <r-> d <r^ a: 



a 

b + d 

* 



d 
e 

* 



d 


a 




* 


b 


d + e - 


- a 


* 


* 


* 




* 



(21) 



|1,2| {2,3| |1,2| {2,3| 

This case is summarized as / «— ^ / or / «— ^ / <— > /, where / «— ^ / is needed for 
the case d i^ b,c. We do not repeat this comment for the other cases below. 

Case 3-2 d does not appear in the first row of /. 

If a appears only once in the second row in the 7-th column, j > 1 , then we can 

1 {1,2| 

make the swap {1, 2} : a <-> J to make / proper, which is summarized as / < — > P or 

12,31 {1,21 
/ < > / < > P. 

Hence we consider the case that a appears in two columns labeled by /, j^, 1 < / < 
f of the second row of /. Then since P' does not contain a in the first row, / has a 
row not containing a. 

Case 3-2-1 The third row of (ED contains a. 

Without loss of generality, suppose the fourth row of / does not contain a. 
Denote e = 641. Interpreting two a's in the second row as a collision, we try to 
resolve the collision by swapping a at the /-th column down to the third row. 
Then we have a process of swaps 



{2, 4} : fl <-^ 5i <-> 52 <-> • ■ • . 

During the process the collisions occur in the second row. If d appears in this 
process, we discard this process and choose a in the 7^-th column. Then the 
process 



{2, 4} : a <r-> s\ <r^ s'2 <r^ 



(22) 
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does not contain d. Then as in Lemma 12.21 no candidate appears twice in (|22l) 
and (l22l) is a finite chain swap resolving the collisions. 
At this stage / looks like 



a 




* 


* 


b + d- 


-a 


* 


* 


c 




* 


* 


e 




* 


* 


* 




* 


* 





a 




* 


* 




e + d - 


a 


* 


* 




c 




* 


* 


or 


b 




* 


* 




* 




* 


* 




* 




* 


* 



In either case, the swap {1,2} : a <-^ d increases V and makes / proper. The 

{2,4| {1,2| |2,3) |2,4) 

whole process for this case is summarized as / < — > / < — > P or I < — > / < — > 

I l—^P. 

Case 3-2-2 The third row of (1211 does not contain a. 

We can just use the third row of (1211 as the fourth row of the the previous 

{2,4| {2,3) 

case. Hence / < — > / is replaced by / < — > / and this case is summarized as 

T '2-31 , 11.2) 

/ < > / < > P. 



We now summarize what we have proved so far. We will again discuss the following result 
in Section llJ] 

Suppose that P and P' are two proper datasets with the same sufficient statistic. Choose any 
i-th row from P and any i'-th row from P', which are different. If we allow improper datasets, 
then by a sequence of swap operations among two rows of P, we can make the i-th row of P 
identical with the ?'-th row of P. Then we throw away this common row from the two datasets 
and repeat the procedure. It should be noted that P may have been transformed to an improper 
dataset / when two rows coincide, but / contains a resolvable pair [ii^, ip^] with Zpr i^ i. Hence 
we can continue this process until P is fully transformed to P'. 

In order to finish our proof of Theorem 12 . 1 J we have to show that each intermediate improper 
dataset can be temporarily transformed to a proper dataset and the consecutive proper datasets 
are connected by operations on three rows. 

We decompose the whole process of transforming P to P' into segments, whenever there 
appears proper dataset. One segment is depicted as follows: 



-^/i 



-^ /, < — > /, 



;+l 



-4 /„ 



^Pn 



(23) 



where each < — > (omitting R) denotes a swap operation among two rows in Lemma 12.71 and 
Lemma |Z8| By Lemma [277] and Lemma [Z8l the number of concurrences in P,„ is larger than 
in Pi. We claim that for any consecutive improper datasets /,, /,+i, we can find proper datasets 
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P,-,P;,/';^i satisfying 



Pi^^h^^I^^^Pl,, (24) 

P'. ^^ /, ^^ P,. (25) 



The swap operation for /, < — > /;+i is compatible with both datasets. Hence if we chose a 
common resolvable pair for /, and /;+i, then (|24l) for transforming P; to P[ involves three rows. 
On the other hand, since both P\ < — > /, and /, < — > P, involve an improper row, the operation 
of transforming P'. to P, involves three rows. 
This completes the proof of Theorem 12. II 

3 Structure of moves of degree two and three 

To analyze the structure of moves in the Markov basis, it is enough to consider the moves of 
degree two and three because of Theorem |2T| It means that we only need to consider the dataset 
consisting of two or three votes. Then we can analyze the structure of moves by discussing the 
structure of fiber for the sufficient statistic. Details of computational results used in this section 
are available at [HI. 

3.1 Moves of degree two 

We begin by discussing the structure of fibers for datasets which consist of two votes. Consider 
a sequence of multisets consisting of two elements in [n] of the form: 

M = {{ai,a2},...,{a2r~i,a2r}), (26) 

where aj e [n],j = \,...,2r. Each multiset {a2j-i,a2j} corresponds to the multiset of two 
candidates in the j'-th position. This sequence is a possible observation of sufficient statistic 
in the (n, r)-Birkhoff model if and only if each k e [n] appears in the multiset {ai,a2, . . . , air} 
at most twice. For the observation ({ai, ai), . . . , {^2,-1, «2r}) of the sufficient statistic, define a 
graph Gm on the vertex set [r] as follows: for each j, / € [r], j ^ /, an edge {j, /} of Gm exists 
if and only if {a2j-i,a2j} n {a2/-i,a2/} i" 0. We call the multiset {a2j^\,a2j} the j'-th block for 
7e[r]. 

For an isolated vertex in Gm the corresponding block has the form either {k, k} or {k, k'} for 
some k, k' e [n],k + k' . In the former case there is no necessity to distinguish two votes by this 
block. In the latter case the votes might be distinguished by this block. Since every non-isolated 
vertex is contained by at most two edges, each connected components of Gm is a chain or a cycle 
if it consists of more than one vertex. Then the candidates are uniquely assigned to two votes 
as a subset of the votes. Let L be the number of connected components of Gm brushing aside 
those of the form \k, k\ for some fc € \n\. The number of elements of the corresponding fiber is 
2^ ^ Especially, the move arising from the corresponding fiber is indispensable if and only if 
L = 2. 
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We evaluate the number of moves of degree two in a minimal Markov basis. For the case 
r = 2 the number of moves of degree two is 6rA. For the case r = 3, the number of moves of 
degree two is 

Consider the case r = 4. The number of moves of degree two for the partition (3, 1) of four 

is 

4\(n\ 6! (4\(n\ 5! (n\ (n\ 

(2-1)+ jL „ (2 -1) = 240 1 + 1440 1. 



P/\6/l!l!l!l!2!' ' \3/\5/ 1!1!1!2!' ' \5) \6 

The number of moves of degree two for the partition (2, 2) of four is 



2!\2/\6/l!l!l!l!l!l!' \2/\5/ 1!1!1!2!' ' 2!\2/\4/2!2! 

The number of moves of degree two for the partition (2, 1 , 1) of four is 



2;\7/l!l!l!2!2! 2!1!1!\6/ 1!1!1!2!1! \2/\6/2!2!2! 

4! (n\ 5! 
^2!1!1!(5J2!2!1!*' ~ ^'' 

The number of moves of degree two for the partition (1 , 1, 1, 1) of four is 
8! (4\(n\ 7! (4\(n\ 6! 

'^^2!2!2!2!^^ " '^ " i3)i7)2!2!2!TT^^ " '^ ^ (2)(6)2!2!mT^' " '^ 

= losor I + 756or I + iv64or 

Then the number of moves of degree two for r = 4 is 

18(^) + 960Q + 10620(^) + 30240(;) + 1764o(^ 

By the similar calculation we obtain the following polynomial which represents the number 
of moves of degree two for the case r = 5: 

1050r I + 40050r j + 485100r | + 2444400r j + 3969000r j + 1701000J " 

The number of moves of degree two in minimal Markov bases are summarized as Table \T\ 
The authors confirmed that the numbers above the horizontal lines in Table [T] coincide with the 
numbers obtained by the output of the computational software 4ti2([|2l). 
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Table 1 : Number of moves of degree two. 



\. r 
n \^ 


2 


3 


4 


5 


1 














2 














3 














4 


6 


18 


18 





5 
6 


30 
90 


360 
2160 


1050 


1050 


16650 


46350 


7 


210 


8190 


125370 


787500 


8 
9 


420 
756 


23940 


611940 
2262708 


7505400 
46928700 


58968 


10 


1260 


128520 


6898500 


218276100 



3.2 Moves of degree three 

The structure of fibers for datasets which consists of three votes is more complicated. Let 

M = {{aubuC\},...,{ar,br,Cr\) 

be the observed sufficient statistic where Qj, bj, Cj e [n],j = I,. . .,r and each k G [n\ appears 
in the multiset {ai, . . .,ar,bi, . . .,br,ci, . . ., Cr} at most three times. Similarly to the case of two 
votes, a graph Gm on [r] can be defined: an edge {j, /} of Gm exists if and only if {aj, bj, Cj] n 
{af,bf,Cj'} 4^ 0. For example, consider the case n = 6, r = 3 and let the set of candidates be 
{a,b,c,d,e,f]. Let 

M = ({a, a, b}, {c, c, d}, {d, e, /}) 

be the observed sufficient statistic. The vertex set of Gm is [3] and the connected components 
are {1} and {2, 3}. The possible assignment in the first connected component is {(a), (a), (b)}. In 
the second connected component there are two kinds of assignments, {(c,d),(c,e),(d,f)} and 
{(c, d), (c, /), (d, e)}. In this case the number of elements of the corresponding fiber is six. 

Now we discuss the detailed structure of fibers arising from the sufficient statistic M = 
({ai,bi,ci}, ..., [ar, b,-, Cr}) such that the associated graph Gm is connected. Thanks to the sym- 
metry in permutation of ranking orders and of labels of the candidates, we consider the equiva- 
lence classes of such sufficient statistics. Figures [l]-|7] show the graph G^'s for all the represen- 
tatives of the equivalence classes for r = 2,3 whose corresponding fiber needs a move of degree 
three for its connectivity. The moves of degree three arising from these figures except Figure |6] 
are indispensable. On the other hand, to guarantee the connectivity of the fiber associated with 
Figure m the Markov basis needs to include a dispensable move of degree three. 

Let us consider the case r = 4. There are 241 different equivalence classes of the sufficient 
statistic M's with connected Gm- For 38 classes among them, the corresponding fibers need 
moves of degree three for their connectivity. Table |2]summarizes the structure of the equivalence 
classes. In this table, 38 equivalence classes are classified by the associated graph Gm, the 
number hm of candidates appearing in M, and whether a move of degree three needed for the 
connectivity of the corresponding fiber is indispensable or not. Figure [S] shows an example of a 
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Figure 1: 






Figure 2: 



Figure 3: 



Figure 4: 






Figure 5: 



Figure 6: 



Figure 7: 



fiber connected by an indispensable move of degree three. On the other hand, the fiber in Figure 
[9] needs a dispensable move of degree three for its connectivity. 

The rest of this section is devoted to the evaluation of the number of moves of degree three 
in a minimal Markov basis. We first evaluate the sizes of equivalence classes of the sufficient 
statistic M's with connected Gm- For the fiber T'm associated with a given sufficient statistic M, 
let ^5^„ be a graph on the vertex set "Fm defined as follows: an edge {x,y} for x,y e Tm exists 
if and only if x and y are connected by a move of degree two. Table |3] counts the sufficient 
statistic M's in each equivalence classes classified by the length r of ranking, the number hm of 
the candidates appearing in M, and the number A^^ of connected components of the graph Qr^^. 

Using Table [31 the number of moves of degree three in a minimal Markov basis can be 
calculated. To illustrate the process of this calculation we define some notations. Let \_M]r',n',N' 
be the equivalence class whose length of ranking is r', the number of candidates is n' , and 
the number of connected component of ^^^^ is N' . Let nr>^n',N' = nM,M e [M],-'„> n'. Let 
Nr',n',N' = Nm,M g [M]r' ,n' ,n' ■ For cxamplc, ^2,3,1 = 3 and A^2,3,i = 1- Denote the size of 
equivalence class {M]r,n',N' by #\r,n' ,N']. For example, #[1,1,1] = 1 and #[2,4,1] = 60. 
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Table 2: Classification of the equivalence classes for r = 4. 



Gm 


flM 


indispensability 


# of equiv. classes 
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* 1 
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yes 


2 
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yes 
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1 1 
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no 


2 


6 


yes 
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no 


6 




7 


no 


2 






5 


yes 


7 




/ 


1 


no 


1 


6 


no 


5 




7 


no 


1 




4 


yes 


1 
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1 


5 


yes 


2 


no 


4 






6 


yes 


1 








no 
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Furthermore, for simplicity, we set 

















ceab 




cdab 




dbca 


deba 


edba 


ebca 








s 




ceba 




cdba 


I abc 1 




dbca 


deab 


edab 


ebca 


J j-T"^ 1 










(^bde 
( cde J 


) 



Figure 9: 



I[n,r',n',N'] = \Mr',n',N']. 
Consider the case r = 2. The number of moves of degree three is 



I[n, 2, 3, 2](A^2,3,2 - 1) = r j X 1 X (2 - 1) = r 



21 



Table 3: Sizes of equivalence classes of sufficient statistics. 



r 


nM 


Nm 


size of equiv. class 


r 


«M 


A^M 


size of equiv. class 


1 


1 


1 


1 


3 


6 


1 


8820 


2 


1 


2 


7 


1 


4410 


3 


1 


1 


4 


4 


1 


1128 


2 


2 


1 


2 


2 


144 


3 


1 


30 


5 


1 


82080 


2 


1 


2 


23040 


4 


1 


60 


4 


600 


5 


1 


30 


6 


1 


885240 


3 


3 


1 


30 


2 


60480 


2 


1 


7 


1 


2847600 


4 


1 


1128 


2 


37800 


2 


144 


8 


1 


3749760 


5 


1 


5760 


9 


1 


1814400 


2 


150 











Consider the case r = 3. The number of moves of degree three for the partition (3) of three 



IS 



I[n, 3, 3, 2](A^3,3,2 - 1) + I[n, 3, 4, 2](A^3,4,2 - D + I[n, 3, 5, 2](A^3,5,2 - 1) 

The number of moves of degree three for the partition (2, 1) of three is 
PW, 2, 3, 2] X (l[n - n2,3,2, 1,1, l](A^2,3.2M,i,i - D 

+I[n - ^2,3,2, 1,2, l](A^2,3,2M,2,l - 1) + /[« - ^2,3,2, 1, 3, 1](A^2,3,2M.3,1 ' 1)) 

Then the number of moves of degree three for r = 3 is 



IS 



lh<h<< 



Consider the case r = 4. The number of moves of degree three for the partition (4) of four 

/[n, 4, 4, 2](A^4,4,2 - 1) + /[«, 4, 5, 2](A^4,5,2 - 1) + /[", 4, 5, 4](A^4,5,4 - 1) 
+I[n, 4, 6, 2](A^4,6,2 - 1) + I[n, 4, 7, 2](A^4,7,2 - D 

= 144r I + 24840J^| + 60480r j + 37800r 
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The number of moves of degree three for the partition (3 , 1) of four is 
l^\(l[n,3,3,2] X (l[n - ^3,3,2, 1, 1, l](A^3,3,2M,i,i - D 

+I[n - n3,3,2, 1,2, l](A^3,3,2M,2,i - 1) + I[n - ^3,3,2, 1,3, l](A^3,3,2M,3,i - 1)) 
+I[n, 3, 4, 2] X (l[n - ^3,4,2, 1,1, l](A^3,4,2M,u - 1) 

+I[n - n^,A,2, 1,2, l](A^3,4,2M,2,i - 1) + I\.n - ^3,4,2, 1,3, l](A^3,4,2M,3,i - 1)) 
+/[n, 3, 5, 2] X [l[n - ^3,5,2, 1,1, l](A^3,5,2M,i,i - D 

+I\n - ^3,5,2, 1,2, l](A^3,5,2M,2,l - 1) + /[« " «3,5,2, 1,3, 1](A^3,5,2M,3,1 ' 1))) 

= 16J J + 2960J"| + 20960J" j + 45360r j + 33600r j. 
The number of moves of degree three for the partition (2,2) of four is 
if^kn, 2, 3, 2] X (l[n - ^2,3,2, 2, 2, l](iV2,3,2A^2,2,i - 1) 

+/[n - ^2,3,2, 2, 3, l](A^2,3,2A^2,3,l - 1) + /[« - ^2,3,2, 2, 3, 2](A^2,3,2A^2,3,2 ' 1) 
+/[n - 712,3,2, 2, 4, 1](A^2,3,2A^2,4,1 " 1) + /[/^ " «2,3,2, 2, 5, 1](A^2,3,2A^2,5,1 " 1)) 

= i2or I + 378or I + i260or I + loosor j. 

The number of moves of degree three for the partition (2, 1, 1) of four is 

1 4! 

/[n,2,3,2] 

2!2!1!1! '-'''■' 

x(/[n - 772,3,2, 1,1, l](/[n - 7J2,3,2 ' ^^l,!,!, 1, 1, l](^2,3,2M,l,lM,l,l ' 1) 

+I[n - 772,3,2 - "1,1,1, 1,2, l](A^2,3,2M,l,lM,2,l - 1) 

+I[n - 7J2,3,2 - "1,1,1, 1,3, l](A^2,3,2M,l,lM,3,l " 1)) 

+I{n - 712,3,2, 1, 2, l](/[72 - 712,3,2 " "l,2,l, 1,1, l](A^2,3,2M,2,lM,l,l " 1) 

+I{n - 712,3,2 - "1,2,1, 1, 2, l](A^2,3,2M,2,lM,2,l " 1) 

+/[7Z - 7J2,3,2 - "1,2,1, 1,3, l](A^2,3,2M,2,lM,3,l " 1)) 

+/[" - "2,3,2, 1,3, l](/[" - "2,3,2 - "1,3,1, 1,1, l](A^2,3,2M,3,lM,l,l ' 1) 

+/[" - "2,3,2 - "1,3,1, 1, 2, l](A^2,3,2M,3,lM,2,l - 1) 

+/[" - "2,3,2 - "1,3,1, 1,3, l](A^2,3,2M,3,lM,3,l ' 1))) 

= i2or| + i44or| + 6720r| + 13440r| + 10080r 
Then the number of moves of degree three for r = 4 is 

160J J + 28040r j + 86660J^j + 102480J J + 57120r j + lOOSOJ" 
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Table 4: Number of moves of degree three. 



\, r 
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160 


160 





5 
6 


10 
20 


1000 
3680 


28840 


28840 


257300 


7056240 


7 


35 


10325 


1303540 


84797440 


8 
9 


56 

84 


24416 


4884880 
15046080 


565736640 
2735910240 


51240 


10 


120 


98400 


40267080 


10678207680 



By the similar calculation we obtain the following polynomial which represents the number 
of moves of degree two for the case r = 5: 



28840r j + 6883200r j + 36009400r j + 83316800J"j + 1078980001 " 



+761040001 



(;).27720000(;,) + 3696000(;). 



The number of moves of degree three in minimal Markov bases are summarized as Table |4] 
The authors confirmed that the numbers above the horizontal lines in Table |4] coincides with the 
numbers obtained by the output of the computational software 4ti2(|l2l|). 

4 Some discussions 

In this section we discuss some topics related to our main result. 

4.1 Extension of fibers by allowing one negative element 



As discussed after the proof of Lemma |Z8l we have shown the following result by our proof of 
Theorem |2](cf. dH). 

Proposition 4.1. Let P and P' be any two proper datasets with the same sufficient statistic. If 
we allow incomplete datasets, P and P' are connected by swap operations among two rows at 
each step. 

Note an improper dataset has one -1 in its orthogonal array representation. Then Proposi- 
tion 14.11 seems to suggest that every fiber Tt for the configuration A„ ,. becomes connected by 
degree two moves if we extend T^t by allowing one negative element x(cr) = - 1 in jc which sat- 
isfies t = Ax. However this is not correct. In fact allowing -1 in the orthogonal representation 
and allowing -1 mTt are two different things. 
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This can be confirmed by the following basic example. For n = 3 and r = 2 with candidates 
labeled as a,b,c, it is easily seen that dimkerA3 2 = 1 and IA32 is a principal ideal generated 
by a single binomial p(ab)p(bc)p(ca) - p{ac){cb)p{ba). Hence there is no degree two move in 
kerz A3 2- Yet, we can connect two datasets 



P = 



by applying {2, 3} : a <-> Z?, {1, 2} : Z? <-^ c and {2, 3} : a <-> c in this order: 



a 


b 
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P' = 
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a 
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a 
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b 




a 




c 
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-^ 


a 
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-^ 
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-^ 
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a 




b + c - 


- a 


a 




b + c - 


- a 


a 




b 


a 



Note that the middle two datasets can be interpreted either as 

adding {ab), {ac), {be), (ca) and subtracting (ac) 



or as 

adding (ab), (ac), (ba), (cb) and subtracting (ab). 

However the middle two datasets do not correspond to an element of a fiber for A3 2. 

4.2 Normality 

Here we discuss the normality of the semigroup generated by A„ r and its relation to Lemma 

Each column of the configuration matrix A,,,- such as A4 3 in Q can be considered as the 
stacked form of an n x r 0- 1 matrix. Consider the set 2 of n x r real matrices X = {xij} satisfying 



< Xij < 1, Vi, j, Yj ^'J ^ 1' ^'' 2 

j=i i=i 



Xij = 1,V/ 



2 is a polytope in R."^''. By |l5l the set of vertices of Q is exactly the same as the set of columns 
of A,,,-. Then by the results of [[HI and [[TOl the semigroup generated by A,,,,- is normal. Lemma 
12.21 is a consequence of this normality, because by the normality each fiber indexed by M in 
(|26|) has a solution consisting of two valid votes. These two valid votes can be obtained from 
the two rows of a swap operation in Lemma IZ2l 

However the normality is not useful in proving Lemmas l2.9l and |2. 101 



4.3 Generation of moves for running a Markov chain 

Our detailed investigation of Markov basis for r < 5 suggests that it will be difficult to obtain 
an exhaustive list of various types of elements of Markov bases for r > 6. However based 
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on Theorem 12. II we can run a Markov chain for general r as follows. We randomly generate 
two or three valid votes of r candidates out of n candidates. Once these votes are obtained, 
we randomly perform permutations of candidates in the same position. We do this for each 
position. If no collision occurs, then we have two or three valid votes. If the obtained set of 
votes is different from the initial set, then the difference is a move. In this way, we obtain a 
random move of degree two or three and then run a Markov chain over a given fiber. 
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